Using confidence measures and domain knowledge to improve speech recognition

نویسندگان

  • Pascal Wiggers
  • Léon J. M. Rothkrantz
چکیده

In speech recognition domain knowledge is usually implemented by training specialized acoustic and language models. This requires large amounts of training data for the domain. When such data is not available there often still exists external knowledge, obtainable through other means, that might be used to constrain the search for likely utterances. This paper presents a number of methods to exploit such knowledge; an adaptive language model and a lattice rescoring approach based on Bayesian updating. To decide whether external knowledge is applicable a word level confidence measure is implemented. As a special case of the general problem station-to-station travel frequencies are considered to improve recognition accuracy in a train table dialog system. Experiments are described that test and compare the different techniques.

منابع مشابه

Verification of Speech Recognition Results Incorporating In-domain Confidence and Discourse Coherence Measures

Conventional confidence measures for assessing the reliability of ASR (automatic speech recognition) output are typically derived from “low-level” information which is obtained during speech recognition decoding. In contrast to these approaches, we propose a novel utterance verification framework which incorporates “high-level” knowledge sources. Specifically, we investigate two measures: in-do...

متن کامل

روشی جدید در بازشناسی مقاوم گفتار مبتنی بر دادگان مفقود با استفاده از شبکه عصبی دوسویه

Performance of speech recognition systems is greatly reduced when speech corrupted by noise. One common method for robust speech recognition systems is missing feature methods. In this way, the components in time - frequency representation of signal (Spectrogram) that present low signal to noise ratio (SNR), are tagged as missing and deleted then replaced by remained components and statistical ...

متن کامل

بهبود عملکرد سیستم بازشناسی گفتار پیوسته بوسیله ویژگی‌های استخراج شده از مانیفولدهای گفتاری در فضای بازسازی شده فاز

The design for new feature extraction methods out of the speech signal and combination of their obtained information is one of the most effective approaches to improve the performance of automatic speech recognition (ASR) system. Recent researches have been shown that the speech signal contains nonlinear and chaotic properties, but the effects of these properties are not used in the continuous ...

متن کامل

Unsupervised confidence calibration using examples of recognized words and their contexts

This paper presents a novel unsupervised calibration framework of word confidence measures for automatic speech recognition. It makes it possible to improve the quality of confidence measures in situations where the training of parametric models is hindered by a lack of human-labeled in-domain data. The proposed method calibrates confidence scores by utilizing recognition results stored in depl...

متن کامل

A New Method for Speech Enhancement Based on Incoherent Model Learning in Wavelet Transform Domain

Quality of speech signal significantly reduces in the presence of environmental noise signals and leads to the imperfect performance of hearing aid devices, automatic speech recognition systems, and mobile phones. In this paper, the single channel speech enhancement of the corrupted signals by the additive noise signals is considered. A dictionary-based algorithm is proposed to train the speech...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

متن کامل
عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003